AITopics | visual graph

Collaborating Authors

visual graph

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

56dc0997d871e9177069bb472574eb29-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 12:05:52 GMT

graph, language graph, visual graph, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback

GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning

Neural Information Processing SystemsDec-23-2025, 16:36:10 GMT

artificial intelligence, large language model, natural language, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)

Add feedback

00295cede6e1600d344b5cd6d9fd4640-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 16:50:05 GMT

dataset, graph, visual graph, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China > Hong Kong (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

56dc0997d871e9177069bb472574eb29-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 23:27:43 GMT

graph, language graph, visual graph, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback

GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning

Neural Information Processing SystemsMay-26-2025, 14:38:40 GMT

Large Language Models (LLMs) are increasingly used for various tasks with graph structures. Though LLMs can process graph information in a textual format, they overlook the rich vision modality, which is an intuitive way for humans to comprehend structural information and conduct general graph reasoning. The potential benefits and capabilities of representing graph structures as visual images (i.e., \textit{visual graph}) are still unexplored. To fill the gap, we innovatively propose an end-to-end framework, called \textbf{G} raph to v \textbf{I} sual and \textbf{T} extual Integr \textbf{A} tion (GITA), which firstly incorporates visual graphs into general graph reasoning. Extensive experiments on the GVLQA dataset and five real-world datasets show that GITA outperforms mainstream LLMs in terms of general graph reasoning capabilities.

artificial intelligence, large language model, natural language, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning

Zhu, Yingjie, Bai, Xuefeng, Chen, Kehai, Xiang, Yang, Zhang, Min

arXiv.org Artificial IntelligenceDec-18-2024

Large Vision-Language Models (LVLMs) have demonstrated remarkable performance across diverse tasks. Despite great success, recent studies show that LVLMs encounter substantial limitations when engaging with visual graphs. To study the reason behind these limitations, we propose VGCure, a comprehensive benchmark covering 22 tasks for examining the fundamental graph understanding and reasoning capacities of LVLMs. Extensive evaluations conducted on 14 LVLMs reveal that LVLMs are weak in basic graph understanding and reasoning tasks, particularly those concerning relational or structurally complex information. Based on this observation, we propose a structure-aware fine-tuning framework to enhance LVLMs with structure learning abilities through 3 self-supervised learning tasks. Experiments validate the effectiveness of our method in improving LVLMs' zero-shot performance on fundamental graph learning tasks, as well as enhancing the robustness of LVLMs against complex visual graphs.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2412.1354

Country:

Europe > Austria > Vienna (0.14)
Asia > China > Guangdong Province > Shenzhen (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)

Add feedback

Rendering Graphs for Graph Reasoning in Multimodal Large Language Models

Wei, Yanbin, Fu, Shuai, Jiang, Weisen, Kwok, James T., Zhang, Yu

arXiv.org Artificial IntelligenceFeb-3-2024

Large Language Models (LLMs) are increasingly used for various tasks with graph structures, such as robotic planning, knowledge graph completion, and common-sense reasoning. Though LLMs can comprehend graph information in a textual format, they overlook the rich visual modality, which is an intuitive way for humans to comprehend structural information and conduct graph reasoning. The potential benefits and capabilities of representing graph structures as visual images (i.e., visual graph) is still unexplored. In this paper, we take the first step in incorporating visual information into graph reasoning tasks and propose a new benchmark GITQA, where each sample is a tuple (graph, image, textual description). We conduct extensive experiments on the GITQA benchmark using state-of-the-art multimodal LLMs. Results on graph reasoning tasks show that combining textual and visual information together performs better than using one modality alone. Moreover, the LLaVA-7B/13B models finetuned on the training set achieve higher accuracy than the closed-source model GPT-4(V). We also study the effects of augmentations in graph reasoning.

graph, node, visual graph, (16 more...)

arXiv.org Artificial Intelligence

2402.0213

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

SCOPE: Structural Continuity Preservation for Medical Image Segmentation

Yeganeh, Yousef, Farshad, Azade, Guevercin, Goktug, Abu-zer, Amr, Xiao, Rui, Tang, Yongjian, Adeli, Ehsan, Navab, Nassir

arXiv.org Artificial IntelligenceApr-27-2023

Although the preservation of shape continuity and physiological anatomy is a natural assumption in the segmentation of medical images, it is often neglected by deep learning methods that mostly aim for the statistical modeling of input data as pixels rather than interconnected structures. In biological structures, however, organs are not separate entities; for example, in reality, a severed vessel is an indication of an underlying problem, but traditional segmentation models are not designed to strictly enforce the continuity of anatomy, potentially leading to inaccurate medical diagnoses. To address this issue, we propose a graph-based approach that enforces the continuity and connectivity of anatomical topology in medical images. Our method encodes the continuity of shapes as a graph constraint, ensuring that the network's predictions maintain this continuity. We evaluate our method on two public benchmarks on retinal vessel segmentation, showing significant improvements in connectivity metrics compared to traditional methods while getting better or on-par performance on segmentation metrics.

artificial intelligence, machine learning, segmentation, (17 more...)

arXiv.org Artificial Intelligence

2304.14572

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Europe > France > Grand Est > Bas-Rhin > Strasbourg (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.90)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

8 Tools Every Data Scientists Should Use

#artificialintelligenceAug-28-2022, 04:45:07 GMT

I explain Artificial Intelligence terms and news to non-experts. Two years ago, I saw my first research paper ever. I remember how old it looked and how discouraging the mathematics inside was. It really did look like what the researchers worked on in movies. To be fair, the paper was from the 1950s, but it hasn't changed much since then.

explanation, great youtube channel, research paper, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence (0.78)
Information Technology > Communications > Social Media (0.67)

Add feedback